Tencent has released the HY-Embodied-0.5 foundational model specifically designed for robots, aiming to address the shortcomings of general visual language models in 3D spatial perception and physical interaction, and to advance large models toward the field of robot control. The series of models have been restructured in both architecture and training, and are accompanied by the release of main models such as MoT-2B.
The Google Gemini AI chatbot now includes interactive 3D models and simulation features, helping users learn scientific concepts through dynamic visualization. Users simply need to issue a command to generate interactive 3D content and visual charts, presenting complex topics in a dynamic format, surpassing traditional text and static diagrams.
Google's AI assistant Gemini now features interactive 3D models and dynamic simulations, allowing users to explore spatial structures and physical laws through rotatable, scalable visualizations like moon orbits or double pendulums, with adjustable variables for enhanced understanding.....
Google Gemini introduces a new feature that generates interactive 3D models and physics simulation scenarios, bridging the gap from text-based answers to intuitive teaching. When users ask questions related to physics or 3D space, the AI provides a dynamic window that supports free dragging and multi-dimensional perspective adjustments, enhancing the interactive experience.
An AI image-to-3D model tool developed by Microsoft. It can generate high-quality 3D models in 1 - 3 minutes and supports multiple software.
Kreat3D is an AI-driven 3D model creation platform that can quickly convert images and text into 3D models.
Use AI to convert text and images into 3D models, suitable for AR experiences, product visualization, etc.
Hunyuan 3D AI converts text and images into high-quality 3D models with PBR textures without the need for modeling experience.
Google
$0.49
Input tokens/M
$2.1
Output tokens/M
1k
Context Length
Openai
$2.8
$11.2
Xai
$1.4
$3.5
2k
$7.7
$30.8
200
-
Anthropic
$105
$525
$0.7
$7
$35
$17.5
$21
Alibaba
$2
$20
$4
$16
Baidu
128
$6
$24
256
$1
$10
ImrozeAslamMalik
LGM is an integrated image-to-3D workflow incorporating multi-view diffusion models, capable of generating high-quality 3D content from a single image.
MonsterMMORPG
The TRELLIS image-conditioned version is a large-scale 3D generation model capable of generating corresponding 3D models from input 2D images.
VAST-AI
TripoSG-scribble is an AI tool that rapidly generates 3D models from scribble images and text prompts. As a variant of TripoSG, it is suitable for creative design and rapid prototyping.
Stable-X
An improved version of TRELLIS that supports converting 2D images into 3D models, with special support for normal conditioning.
homebrewltd
AlphaSpace is an innovative approach designed to enhance the spatial reasoning capabilities of language models for robotic manipulation in 3D Cartesian space.
Menlo
AlphaSpace is an innovative method that enhances language models' spatial reasoning capabilities for robotic manipulation in 3D Cartesian space.
TrianC0de
TripoSR is a fast feed-forward 3D generation model developed collaboratively by Stability AI and Tripo AI, capable of rapidly generating 3D models from a single image.
zhang3z
dust3r is a deep learning model for generating 3D models from images, supporting multi-view 3D reconstruction.
IvanTang
ENEL is a model exploring the potential of encoder-free architecture in 3D large multimodal models.
stanfordmimi
A family of medical image processing models consisting of six large-scale, generalizable 2D/3D variational autoencoders capable of encoding medical images into compressed latent representations and achieving high-fidelity image reconstruction.
craftsman3d
CraftsMan is a high-fidelity mesh generation system based on native 3D generation and interactive geometry optimization, capable of generating high-quality 3D mesh models from a single image.
WizWhite
A LoRA model for generating paper miniature models, specializing in creating flat cardboard scenes and 3D paper objects with a vintage style.
facebook
VFusion3D is a large-scale feed-forward 3D generation model trained with limited 3D data and extensive synthetic multi-view data, representing the first work to explore scalable 3D generation/reconstruction models.
jadechoghari
VFusion3D is a large-scale feed-forward 3D generation model trained with limited 3D data and extensive synthetic multi-view data, exploring scalable 3D generation/reconstruction models.
dylanebert
LGM is a high-resolution 3D content creation pipeline integrating multi-view diffusion models, specifically designed for 3D machine learning courses.
naver
DUSt3R is a deep learning model for generating 3D geometric models from images, capable of easily handling geometric 3D vision tasks.
Yiwen-ntu
MeshAnything is an artist-grade mesh generation model based on autoregressive Transformers, capable of converting images or point clouds into high-quality 3D mesh models.
GoodBaiBai88
M3D is a 3D medical image analysis technology based on multimodal large language models, including the M3D-Data dataset, M3D-LaMed model, and M3D-Bench evaluation benchmark.
zxhezexin
OpenLRM is an open-source implementation of the LRM paper for generating 3D models from a single image
OpenLRM is an open-source implementation of the LRM paper, capable of generating 3D models from a single image, with multiple versions of different scales.
Blender MCP VXAI is a powerful integration tool that allows users to control Blender through natural language to create and modify 3D models, animations, and scenes. It simplifies complex operations and supports real-time export to projects.
FreeCAD MCP is a plugin for controlling FreeCAD through Claude Desktop, supporting various design functions such as creating 3D models from 2D drawings.
An MCP server based on OpenSCAD that generates multi - view images through AI and reconstructs them into parametric 3D models, supporting remote CUDA - accelerated processing.
The OpenSCAD MCP Server is a tool for generating parametric 3D models through text or images, supporting multi-view reconstruction and remote processing.
The OpenSCAD MCP Server is a service for generating parametric 3D models from text or images. It supports multi - view reconstruction, AI image generation, remote CUDA processing, and workflow approval, and finally outputs OpenSCAD - compatible model files.
Trellis MCP is an interface service that connects AI assistants with Trellis 3D generation models, supporting rapid generation of 3D assets through natural language and importing them into Blender. This project is based on an open - source model and requires self - deployment of the API backend. It is fast and free, but there are stability risks.
The game asset generator uses AI models and the MCP protocol to quickly generate 2D and 3D game resources through text prompts.
An open - source project that integrates Blender with local AI models to control 3D modeling through natural language.
The MCP STL 3D Relief Generator is a tool that converts 2D images into 3D relief models, supporting functions such as controlling model size, adding a base, and depth inversion. It is suitable for 3D printing and rendering.
Ludo AI MCP Server is a service that provides AI - generated game assets (such as images, 3D models, animations, and audio) through the Model Context Protocol (MCP), supporting integration with clients such as Claude Desktop and Cursor.
The Meshy AI MCP Server is a model context protocol server for interacting with the Meshy AI API, providing functions such as generating 3D models from text and images, applying textures, and remeshing models.
The Poly.Pizza MCP server is a tool for directly importing free low - poly 3D models into Unity projects, supporting model search, batch import, automatic prefab generation, and copyright information recording.
An MCP server for processing, validating, optimizing, and analyzing 3D models (supporting glTF/GLB formats), providing functions such as model analysis, format conversion, compression, and texture optimization
An MCP server for interacting with the Sketchfab 3D model platform, supporting functions such as searching, viewing details, and downloading 3D models.